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1 Introduction 

A polynomial threshold function (PTF) is a function of the form f(X) = 
sgn(p(X)) for some polynomial p(X). We say that / is a degree-d polynomial 
threshold function of p is of degree at most d. Polynomial threshold functions 
are a fundamental class of functions with applications to many fields such as 
circuit complexity [T], communication complexity [9 and learning theory [BJ. 

We discuss the issue of pseudo-random generators for polynomial threshold 
functions of bounded degree. Namely for some known probability distribution 
T> on K n , we would like to find an explicit, easily computable function G : 
{0, 1} S — > M™ so that for any degree-d polynomial threshold function, /, 

\E Y ~v[f(Y)] - E Xeu{0A}S [f(G(X))}\ < e. 

There are two natural distributions, T>, to study for this problem. The first is 
that of the hypercube distribution, namely the uniform distribution over {0, 1}™. 
The second is the Gaussian distribution. The latter can often be thought of as 
a special case of the former. In particular for polynomials of low influence (for 
which no one variable has significant control over the size of the polynomial), 
the invariance principle says that these polynomials behave similarly on the two 
distributions. In fact many results about the hypercube distribution are proven 
by using the invariance principle to reduce to the Gaussian case where symmetry 
and the continuous nature of the random variables make things considerably 
easier. 

In this paper we construct an explicit PRG for the Gaussian case. In par- 
ticular, for any real numbers c, e > and integer d > we construct a PRG 
fooling degree-d PTFs of Gaussians to within e of seed length log(n)2 0c ( d )e~ 4 ~ c . 
In particular we show that 

Theorem 1. Let c > be a constant. For e > 0, and d a positive integer, 
let N = 2 n ^e- A - c and k = Q c (d) be integers. Let X, 1 < i < N be in- 
dependently sampled from k-independent families of standard Gaussians. Let 
X = Y^i—i Xi. Let Y be a fully independent family of standard Gaussians. 
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Then for any degree-d polynomial threshold function, f , 

\E[f(X)]-E[f(Y)]\<e. 

From this we can construct an efficient PRG for PTFs of Gaussians. In 
particular 

Corollary 2. For every c > 0, there exists a PRG that e-fools degree-d PTFs 
of Gaussians with seed length 

log(n)2°^e- 4 - c . 

Much of the previous work in constructing pseudo-random generators in- 
volves the use of functions of limited independence. It was shown in [3] that 
<9(e~ 2 )-independence fools degree-1 PTFs. The degree-2 case was later dealt 
with in 4 j, in which it was shown that 0(e _9 )-independence sufficed (and that 
(3(e~ 8 ) sufficed for Gaussians). The author showed that limited independence 
suffices to fool arbitrary degree PTFs of Gaussians in [5], but the amount of 
independence required was Od{e~ 2 ° { ). In terms of PRGs that do not rely 
solely on limited independence, [5] found a PRG for degree-d PTFs on the hy- 
percube distribution of size log(n)2°^e~ 8d - 3 . Hence for d more than constant 
sized, and e less than some constant, our PRG will always beat the other known 
examples. 

The basic idea of the proof of Theorem Q] will be to show that X fools 
a function g which is a smooth approximation of /. This is done using the 
replacement method. In particular, we replace the Xi by fully independent 
families of Gaussians one at a time and show that at each step a small error is 
introduced. This is done by replacing g by its Taylor expansion and noting that 
small degree moments of the Xi are identical to the corresponding moments of 
a fully independent family. 

Naively, if / = sgn(p(a;)), we might try to let g — p(p(x)) for some smooth 
function p so that p(x) = sgn(cc) for |x| > 8. If we Taylor expand g to order T— 1, 
we find that the error in replacing Xi by a fully random Gaussian is roughly 
the size of the T th derivative of g times the T th moment of p(X) —p(X'), where 
X' is the new random variable we get after replacing Xi. We expect the former 
to be roughly S~ T and the latter to be roughly \p\^N~ T / 2 . Hence, for this to 
work we will need N ( 2 <5 1 ) 2 • On the other hand, for g to be a good 
approximation of /, we will need that the probability that |p(V)| < 8 to be 
small. Using standard anti-concentration bounds, this requires |p|2^ _1 to be 
roughly e~ d , and hence N will be required to be at least e~ 2d . 

In order to fix this, we use a better notion of anti-concentration. Our 
underlying heuristic is that for any polynomial p it should be the case that 
|p(X)| < e[p'(X)| with probability not much bigger than e. This should hold 
because changing the value of X by e should adjust the value of p(X) by roughly 
e|p'(X)|. This allows us to state a strong version of anti-concentration. In par- 
ticular with probability roughly 1 — e it should hold that 

\p(X)\ > e\p'(X)\ > e 2 \p"(X)\ >...> eV°P0|. (1) 
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So \p(X)\ > e d \p^(X)\. It should be noted that \p (d \X)\ is independent of 
X and can be thought of as a rough approximation to \p\2- In our analysis 
we will use a g that is a function not only of p(X), but also of |p( m )(X)| for 

1 < m < d. Instead of forcing g to be a good approximation to / whenever 
b(^)| > e d , we will only require it to be a good approximation to / at X where 
Equation [TJ holds. This gives us significant leeway since although the derivative 
to g with respect to p(X) will still be large at places, this will only happen when 
|p'(X)| is small, and this in turn will imply that the variance in p{X) achieved 
by replacing Xi is comparably small. 

In Section [3J we will review some basic properties of polynomial threshold 
functions. In Section [31 we in introduce the notion of the derivative (which we 
call the noisy derivative) that will be useful for our purposes. We then prove 
a number of Lemmas about this derivative and in particular prove a rigorous 
version of Equation [TJ In Section 01 we discuss some averaging operators that 
will be useful in analyzing what happens when one of the Xj is changed. In 
Section [5J we use these results and the above ideas to prove Theorem [TJ In 
Section [5J we use this result to prove Corollary [21 

2 Definitions and Basic Properties 

We are concerned with polynomial threshold functions, so for completeness we 
give a definition 

Definition. A degree-d polynomial threshold function (PTF) is a junction of 
the form 

f = sgn(p(x)) 

where p is a polynomial of degree at most d. 

We are also concerned with the idea of fooling functions so we define 

Definition. Let f : R™ — > R we say that a random variable X with values in 
R n e-fools f if 

\E[f(X)]-E[f(Y)]\<e 
where Y is a standard n-dimensional Gaussian. 

For convenience we define the notation: 
Definition. We use 

B 

to denote that 

\A-B\=0(e). 
For a function on R™ we define its L k norm by 
Definition. For p : R™ — > R define 

\p\ k = E x [\p{X)\ k Y' k . 
Where the above expectation is over X a standard n-dimensional Gaussian. 
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We will make use of the hypercontractive inequality. The proof follows from 
Theorem 2 of 0. 

Lemma 3. If p is a degree-d polynomial and t > 2, then 



\p\t < Vt^i \p\ 2 . 

In particular this implies the following Corollary: 

Corollary 4. Let p be a degree-d polynomial in n variables. Let X be a family 
of standard Gaussians. Then 

Pr(\p(X)\>\p\ 2 /2)>9- d /2. 

Proof. This follows immediately from the PaleyZygmund inequality applied to 
ir. □ 

We also obtain: 

Corollary 5. Let p be a degree-d polynomial, t > 1 a real number, then 

\p\t < 2°*W>|p| 1 

Proof. By Lemma [3l it suffices to prove this for t = 2. This in turn follows from 
CorollarySl which implies that > 9 _d /4|p| 2 . □ 

And 

Corollary 6. If p{X, Y), q(X, Y) are degree-d polynomials in standard Gaus- 
sians X and Y then 

PrY{\p{X,Y)\ 2 , x < e\q{X,Y)\ 2 , x )<A-$ d Pr x , Y (\p{X,Y)\<A-Z d e\q(X,Y)\). 

1/2 

Where \r(X,Y)\ 2 ,x denotes the L 2 norm over X, namely (E x [r(X,Y) 2 ]) ' . 

Proof. Given Y so that \p(X,Y)\ 2 , x < e\q(X,Y)\ 2 ,x, by Corollary H we have 
that \q(X,Y)\ > \q(X,Y)\ 2}X /2 with probability at least 9~ d /2. Furthermore, 
\p(X,Y)\ < 2 ■ 3 d \p{X,Y)\ 2 \ x with probability at least 1 - 9^/4. Hence with 
probability at least 9 _d /4 we have that 

\p(X, Y)\<2- 3 d \p(X, Y)\ 2<x < 2 ■ 3 d e\q(X, Y)\ 2 . x < 4 ■ 3 d e\q(X, Y)\. 

So 

Pt x , y (\ P (X,Y)\ <4-3 d e\q(X,Y)\) > ^Pr Y (\p(X, Y)\ 2>x < e\q(X, Y)\ 2 , x ). 

□ 
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3 Noisy Derivatives 



In this section we define our notion of the noisy derivative and obtain some of 
its basic properties. 

Definition. Let X and Y be n- dimensional vectors and 9 a real number. Then 
we let 

N Y (X) := cos(9)X + sm(9)Y. 

If X and Y are independent Gaussians, Y can be thought of as a noisy 
version of X with 6 a noise parameter. We next define the noisy derivative. 

Definition. Let X, Y, Z be n-dimensional vectors, f : R™ — > M a function, and 
9 a real number. We define the noisy derivative of / at X with parameter 9 in 
directions Y and Z to be 

n e fm . f{N$(X))-f(N° z (X)) 

U Y.Z3\ X ) ■— g • 

It should be noted that if X, Y, Z are held constant and 9 goes to 0, the noisy 
derivative approaches the difference of the directional derivatives of / at X in 
the directions Y and Z. The noisy derivative for positive 9 can be thought of as 
sort of a large scale derivative that covers slightly more than just a differential 
distance. 

We also require a notion of the average size of the I th derivative. In particular 
we define: 

Definition. For X an n-dimensional vector, f : R" — >• M a function, I a non- 
negative integer, and 9 a real number, we define 

|jf := Ey uZl _ Ye , Ze [\D^ z D^ Z2 ■ ■ ■ D^ Ze f(X)\% 

where the expectation is taken over independent, standard Gaussians Yi,Zi. 
Lemma 7. For p a degree-d polynomial, and 9 a real number 

\p { e\x)\2 

is independent of X . 

Proof. This follows from the fact that for fixed Y i} Zi that 

is independent of X. This in turn follows from the fact that for any degree-cZ 
polynomial q and any Y,Z, Dy z q{X) is a degrcc-(c? — 1) polynomial in X. □ 

We now prove our version of the statement that the value of a polynomial 
is probably not too much smaller than its derivative. 
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Proposition 8. Let e, 9 > be real numbers with 9 = 0(e). Let p be a degree-d 
polynomial and let X, Y, Z be standard independent Gaussians. Then 

Pr x . Y ,z(\p(X)\ < e\D e YjZ p{X)\) = 0(d 2 e). 

To prove this we use the following Lemma: 

Lemma 9. Let e,9,p,d 7 X 7 Y be as above. Then 

Pr x , Y (\p{X)\ < e -\p(X)-p(N^X))\)=0{d>e). 

Proof. The basic idea of the proof will be the averaging argument discussed in 
the Introduction to this Part. We note that the above probability should be the 
same for any independent Gaussians, X and Y. We let X^ := Ny(X). Note that 
Xcf, and X ( p +7T f 2 arc independent of each other. Furthermore, N x ^ + ^ (X^) = 
X^+e. Hence for any 4>, the above probability equals 



Pr x , y (|p(J^)| < - e \p{X^)-p{X^e)\) 



We claim that for any values of X and Y, that the average value over 4> & [0, 2tt] 
of the above is 0(d 2 e). 

Notice that with X and Y fixed, piX^) is a degree-ci polynomial in cos(0) 
and sin(0). Letting z — e 1 ^ we have that cos(</>) = z+z — , sin(0) : 
Hence p(X^) = z~ d q(z) for some polynomial q of degree at most 2d. 

We wish to bound the probability that 



2z 



< 



- d q{z) 



^ — d^—id9 



q(ze i8 )\ \q(z) 



-id9 



q(ze ie )\ 



Mz)\ 



\Q(z)\ 



For - sufficiently small, we may instead bound the probability that 



log 



9 I i 

7 q(ze l 



> 



2e 



On the other hand, we may factor q(z) as aH-f^z — ^) where rj are the roots 
of q. The left hand side of the above is then at most 



2d 



d9 + J2 



lot! 



<d9 + J2° 

»=i 

2d 

<d9 + 9^0 



Hence it suffices to bound the probability that 

2d 



t! \\z-ri\J 2e 



G 



If 4e > d , there is nothing to prove. Otherwise, the above holds only if 

2d ( 1 \ 1 

This in turn only occurs when z is within 0(de) of some rj. For each ri this 
happens with probability O(de) over <p, and hence by the union bound, the 
above holds with probability 0(d 2 e). □ 

Note that a tighter analysis could be used to prove the bound 0(d\og(d)e), 
but we will not need this stronger result. 

Proposition [5] now follows immediately by noting that |p(A)| is less than 
e\D^ z p(X)\ only when either \p(X)\/2 < % \p(X) - p(N Y (X))| or \p(X)\/2 < 
||p(A) — p(N z (X))\. This allows us to prove our version of EquationQ] 

Corollary 10. For p a degree-d polynomial, X a standard Gaussian, e, 9 > 
with 9 = 0(e), and £ a non-negative integer, 

Pr x (\pf\x)\ 2 <e\pf +1 \x)\ 2 )<2°^e. 

Proof. Let Yi,Zi be standard Gaussians independent of each other and of X 
for 1 < % < I + 1. Applying Proposition [5] to D^ Z2 ■ ■ ■ D Y(+iZe+i p(X), we find 
that 

Prx, Yl , Zl (\D e Y2A ■ ■ ■ D e Ye+uZe+i p(X)\ < 4 • 3 d e\D e YuZi ■ ■ ■ D e Ye+uZe+i p(X)\) 
= 0(d 2 3 d e). 

Noting that 

\pf ] (X)\ 2 = \D e Y2 , Z2 ■■■D^ t+1 , Zt+1 p(X)\ 2t0 r uZl ,...,Y t+l ,z t+1 ) 

and 

\pf +1 \X)\ 2 = K !Zl ---^ +1 ^ +lP (^)|2,(^ 1 ,...^ +1 , Zm)5 

Corollary |6] tells us that 

Pr x (|pf (X)\ 2 < e\$ +1 \x)\ 2 ) = 0(d 2 27 d e). 

□ 

4 Averaging Operators 

A key ingredient of our proof will be to show that if we replace X by X' by 
replacing one of the Aj by a random Gaussian, that the variance of |p^(A')| 
is bounded in terms of (A") | (see Proposition [T^)) . Unfortunately, for our 

argument to work nicely we would want it bounded in terms of the expectation 
of |p^ +1 )(A')|. In order to deal with this issue, we will need to study the 
behavior of the expectation of q(X') for polynomials q, and in particular when 
it is close to q{X). To get started on this project, we define the following 
averaging operator: 
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Definition. Let X be an n- dimensional vector, f : M" — > K a function, and 9 
a real number. Define 

A e f(X) := E Y [f(N Y (X))] . 
Where the expectation is over Y a standard Gaussian. 

Note that the A form the Ornstein-Uhlenbeck semigroup with A 8 = T t 
where cos(6>) = e _t . The composition law becomes T tl T t2 = T tl +t 2 - We express 
this operator in terms of 9 rather than t, since it fits in with our N s notation, 
which makes it more convenient for our purposes. 

We also define averaged versions of our derivatives 

Definition. For p a polynomial, £, m non-negative integers, X a vector, and 9 
a real number let 



\p^ m (X)\l:=E Yl ,..., Ym \\pf\N^--N^X)\l\=(A s r\pf\x)\ 



We claim that for X a standard Gaussian, that with fairly high probability 
\p { ^ m {X)\ is close to \p [ e\x)\. In particular: 

Lemma 11. If p is a degree-d polynomial, and I and m are non-negative inte- 
gers, X a standard Gaussian, and e, 9 > 0, then 

Prx (\\ P f hm+1 (X)\l - \p { e ehm (X)\l\ > e\pf' m {X)\t) < 2°W^" 1 . 

Proof. If \\p ( e ehm+1 {X)\l - |p^ m (X)|!| > e\p^' m (X)\l then by Corollary H 
with probability 2°^ over a standard normal Y we have that 

\\p^' m (N Y (X))\i-\pf^(X)\l\ > e\p^ m (X)\l 

By Lemma IH1 this happens with probability 0(d 2 9e~ 1 ), and hence our original 
event happens with probability at most 2°^ d >9er 1 . □ 

We bound the variance of these polynomials as X changes. 

Proposition 12. Let p be a degree-d polynomial, and 9 > 0, £,m non-negative 
integers, then for X any vector and Y a standard Gaussian 



Vary 



\p^ m (N Y X)\i\ < 2°^^\p^ m {X)\l\p^ m+1 {X)\l. 



We begin by proving a Lemma: 

Lemma 13. Let q{X, Y) be a degree-d polynomial and let X , Y and Z be 
independent standard Gaussians. Then 

Vary [E x [q(X,Y) 2 ]] < 2° {d) E x , YZ [(q(X, Y) - q(X, Z)f] E X , Y [q(X,Y) 2 ] . 
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Proof. Recall that if A,B are i.i.d. random variables, then Var[A] = ^E[(^4 
B) 2 ]. We have that 

Vary [E x [q(X,Y) 2 ]] = ~E Y<Z [(V x [q(X,Y) 2 } - E x [q(X, Z) 2 })' 

< 2°^E YZ [\E X [q(X, Y) 2 - q(X, Z) 2 ] |] 2 

< 2 0{d) E x ^ z [\q(X, Y) - q(X, Z)\\q(X, Y) + q(X, Z)f 

< 2°^E X . Y , Z [{q{X, Y) - q(X, Z)) 2 } E X , YZ [{q{X, Y) + q{X, Z)) 2 } 

< 2°^E X , Y>Z [(q(X, Y) - q(X, Z)) 2 ] E X , Y [q(X, Y) 2 } . 

The second line above is due to Corollary O The fourth line is due to Cauchy- 
Schwarz. □ 

Proof of Proposition ["Tgl For fixed X, consider the polynomial 

q((Y 2 ,Z 2 , ...,Y t+l ,Z l+l ,W u ..., W m ),Y) 
=■■ D 6 Y2 , Z2 ■ ■ ■ D e Ye+uZf+i p(N e Wi ■ ■ ■ N e Wm N Y (X)). 

Let V = (Y 2 , Z 2 , . . . ,Y e+1 , Z e+1 ,W!, . . . ,W m ). Notice that 

\ P ^ m (N Y X)\ 2 2 =E v [q(V,Y) 2 ], 
e\p^ m {X)\ 2 = E v>YjZ [(q(V,Y) - q{V,Z)) 2 } , 
\pf ] ' m+ \X)\l=Ey, Y [q(V,Y) 2 ]. 

Our result follows immediately upon applying the above Lemma to q(V, Y). □ 

We also prove a relation between the higher averages 

Lemma 14. Let d be an integer and 9 = 0(d^ 1 ) a real number. Then there 
exist constants cq, ■ ■ . , c 2 d+\ with \c m \ — 2°^ and Y^m=o ° m = ^ so that for 
any degree-d polynomial p and any vector X , 

2d+l 

c m (A e r P (x) = o. 

m=0 

Proof. First note that (A e ) rn p{X) = A e ™p{X) where cos(6» m ) = cos((9) m . We 
note that it suffices to find such c's so that for any Y 

2d+l 

Y / c m p{N°r{X)) = Q. 

771=0 

Note that cos 2 (6» m ) = 1 - m6 2 + 0(d6 4 ). Hence sin(6» m ) = ^/hl9 + 0(d6 3 ). 
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Recall that once X and Y are fixed, there is a degree-2<i polynomial q so 
that for z m = e ie ™ = 1 + i*Jm~6 + 0(d9 2 ), p{N%T{X)) = z m d q(z m ). Hence we 
just need to pick c m so that for any degree-2d polynomial q we have that 

2d+l 



c mZ m d q{z m ) = 0. 

m— 

Such c exist by standard interpolation results. In particular, it is sufficient to 
pick 

2d+l „ 

Cm = z d m ^(2dy. n ^— 

For this choice of c we have that 

|c ro | = v /(2d)! [] = LI \Tp 1=\) 

For 1 < i < 2m we have that — \/m\ = — m\/y/m). For i > 2m, we 
have that \Vi — \fm\ = Q(\/i). We evaluate |c m | based upon whether or not 
2m>2d + 1. If 2m>2d + 1, then 



2d+l / — \ 

n Kra 

i=\A=tm 



2°^ v / (2d)! 



m d 



2°^Vd^d c 



(m- l)!(2d+l - m)! 

/ 2d \ 

Vm-i; 



(2d!) 



If 2m < 2d+l, 



2m , . s 2d+l / 

-^.n « ^ n « ^ 

8=1, i^m v ' i=2m+l v v 



Cm ~ 



i=l,i/ 

2 0(d) V / (2d)! 



m m / (2m)! 

(m- l)!(m- 1)! V (2d+ 1)! 



2 0(d) m m v /(2m)! 
2 0(d) 



(m- l)!(m- 1)! 
m™7(2m)!( 2 ™- 2 ) 



(2m- 2)! 
2 o(d)_ 



Considering g(X) = 1, we find that X)m=o Cm = 0- ^ 
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Applying this to the polynomial \pg (X)\, we find that 

Corollary 15. Let d be an integer and 9 — 0{d^ 1 ) a sufficiently small real 
number (as a function ofd). Then there exist constants Cq, . . . , c^+i with \c m \ = 
0^(1) and X)m=o c m — so that for any degree-d polynomial p, any vector X , 
and any non-negative integer I, 

id+1 



Furthermore Ylm—o c " 
In particular, 



m=0 

and IcJ = 2°( d ). 



Corollary 16. There exists some absolute constant a, so that if — 0(d 1 ) 
and if 



log 



\p { e e) ' m (X)\l 



}p^ m+ \x)\l 



< a 



for some £ and all 1 < m < 4d, then all of the \Pg^' m {X)\ 2 for that £ and all 

< m < Ad + 1 are within constant multiples of each other. 

Proof. It is clear that \p"p' m {X)\\ are within 1 + 0{da d ) of each other for 

1 < m < 4d + 1. By Corollary [T5l we have that 



id 



\p { e eh °(X)\l = £ 



■-\p^ m {X)\l 



m—l 
id 



CO 



m—l 
id 



CO 



4</ 



= Y J ^\p^\x)\l + \p^\x)\l0\da d Y. 



co 



m—l 



C 71 1 

co 



\p { e h \x)\l(l + 0(d 2 a d 2°W 



Hence for a sufficiently small, |p^- ) ' (J , sr)|| is within a constant multiple of 
\p ( e h \x)\l ' □ 



5 Proof of Theorem [T] 

We now fix c,e,d,N,k,p as in Theorem [1] Namely c, e > 0, d is a positive 
integer, N an integer bigger than B{c) d e~ A ~ c , and fc an integer bigger than 
B(c)d for -B(c) some sufficiently large number depending only on c, and p a 

degree-d polynomial. We fix 9 = arcsin I — h= J ~ B(c) d / 2 e~ 2 ~ c / 2 . 
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Let p : R — > [0, 1] be a smooth function so that p(x) — if x < — 1 and 
p(x) = 1 if x > 0. Let a : R — > [0, 1] a smooth function so that a(x) = 1 if 
\x\ < 1/3 and <r(x) = if |a;| > 1/2. Let a be the constant given in Corollary 

CEH 

For Q < £ < d, < m < 4d + 1, let 9£, m (^) be the degree-2d polynomial 
|p^' m (X)|2. Recall by Lemma [7] that 9d,m is constant. We let g±(X) be 



'<- (± ^-»n>('- («)) 1- (iS^§y)) 



Where I(o.oo)(x) above is the indicator function of the set (0,oo). Namely it is 
1 for x > and otherwise. 

g± approximates the indicator functions of the sets where p(X) is positive 
or negative. To make this intuitive statement useful we prove: 

Lemma 17. The following hold: 

• g ± : R n ->■ [0, 1] 

• g+{X) + g-(X) < 1 for all X 

• ForY a standard Gaussian E Y [l - g+(Y) - g-(Y)} < 2 0( - d) e. 

Proof. The first two statements follow immediately from the definition. The 
third statement follows by noting that g+(Y) + g-(Y) = 1 unless qifi{Y) < 
£ 2 qe+i,o( x ) for some ^ or \lt,m+i{X) - qt tm {X)\ < ft(a d )\qL, n (X)\ for some 
£, m. By Corollary [10] and Lemma 1111 this happens with probability at most 
2 °W e . □ 

We also want to know that the derivatives of g± are relatively small. 

Lemma 18. Consider g± as a function of qi t , n {X) (consider sgn{p{X)) to be 
constant). Then the t th partial derivative of q+, a a — s is at most 

Proof. The bound follows easily after considering g as a function of the log(<7£ jm ). 
Noting that the t th partial derivatives of q in terms of these logs are at most 
Ot(a~ dt ), the result follows easily. □ 

Lemma 19. Given c < 4 ; let X be any vector and let Y and Z be k-independent 
families of Gaussians with k > 512c _1 rf, N = e~ 4 ~ c and 9 = 0(d~ 2 ). Then 

\E Y [g+(N°r(X))] - E Z [g+(N e z (X))] \ < 2°^eN-\ 

And the analogous statement holds for g_ . 
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Proof. First note that e 2 9 = 0(e c/2 ), and hence that (e 2 9) 16 / c = 0(eN- r ). Let 
T be an even integer between 32/c and 64/ c. 

First we deal with the case where | log(% im (X) / q^ m+ \[X))\ > a d for some 

< £ < d, 1 < to < Ad, or <ft,i(X) < e 2 / 10 <^+iaPO for some We claim that 
in either case the E [g + (N Y (X))] = Odi^N^ 1 ) and a similar bound holds for 
Z. If there is such an occurrence, find one with the largest possible £ and of the 
second type if possible for the same value of I. 

Suppose that we had an occurrence of the first type. Namely that for some 
£, m, 

\log(qe, m (X)/q ttm+1 (X))\> a d . 

Pick such a one with I maximal, and with to minimal for this value of I. Consider 
then the random variables q£ im -i(N Y (X)) and qi tm (N Y (X)). They have means 
qe <m (X) and qe tTn+ i(X), respectively. By Proposition [T2l their variances are 
bounded by 

2°W% +1 , m _ 1 (X)^ >m (X), and 2°^9q e+hm (X)q^ m+1 (X), 

respectively. Since we chose the smallest such £, all of the qe + i, m (X) for 

1 < m < 4d + 1 are close to qt+x t i(X) with multiplicative error at most 
a d . By Corollary [16j this implies that qe+ifi(X) is also close. Hence, the 
variances of q^ m ^ 1 {N Y {X)) and q^ m (N Y (Xj) are O d {9q l+1 ^{X)q^ m (X)) and 
Od(0qe+i,i(X)qe ym+ i(X)). Since there was no smaller to to choose, qe, m (X) is 
within a constant multiple of qi t %(X). Since we could not have picked an occur- 
rence of the second type with the same £, we have that qi t \ (X) > e 2 qt+i t i(X)/10. 
Hence both of these variances are at most 

2°^e- 2 em^(q^ m (X),q ltm+1 (X)) 2 ). 

Hence by Corollary [5j for either of the random variables Qi = q£ t7n (N Y (X)), 
or Q 2 = qe, m+ i(N Y (X)) with means the T th moment of \Q+ — fa\ (using 

the fact that Y is at least 4Td-independent) is at most 2° c ^eN~ 1 max(/Xi) T . 
Hence, with probability at least 1 — 2° c ^eN^ 1 , \Qi — m\ < a<1 max(/Xj)/10. But 
if this is the case, then | log(Qi/Q2)| will be more than a d /2, and g+ will be 0. 

Suppose that we had an occurrence of the second type for some I. Again by 
Corollary [TBI we have that qe+i^(X) is within a constant multiple of qe+i t i(X) 
and qi+2,o{X) within a constant multiple of q£ + 2,i(X). Let Qo be the random 
variable qtfi(N e (Y)) and Qi the variable qe+ifl(N s (Y)). We note that they have 
means equal to qt t x(X) and qt+i t i(X), respectively. By Proposition 1121 their 
variances are at most 2 0( ^9qi +1A (X)q eA (X) and 2°^9q i+2l i(X)q i+ i )1 (X). 
Since we had an occurrence at this I but not the larger one, these are at 
most 2 0( -^9e 2 q l+1A (X) 2 and 2°^9e- 2 q l+1 ^{X) 2 , respectively. Considering the 
rpth momen t f q 1 minus its mean, /ii, we find that with probability at least 
1 _ 2 -( d ) e N- 1 that |Qi - qt+i,i\ < &+i,i/ 20 - Considering the T th moment of 
Qo minus its mean, we find that with probability at least 1 — 2 0c ^eN^ 1 that 
\Qo — qe,i\ < e 2 qi+is/20. Together these imply that log(Qo/(e 2 Qi)) < —1 and 
hence that g + (N Y (X)) = 0. 
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Finally, we assume that neither of these cases occur. We note by Corollary 
1161 that for each m and I that qi yTn (X) is within a constant multiple of (X). 
We define Qi, m = qt^miNyiX)) and note by Proposition [T3] that Var(Q^ m ) 
is at most 2°^9e~ 2 E[Qg_ m } 2 . We wish to show that this along with the k- 
independence of Y is enough to determine F,Y[g+(Qi, m )] to within 2° c ^eN~ 1 . 
We do this by approximating g + by its Taylor series to degree T — 1 about 
(E[Qf , m ]). The expectation of the Taylor polynomial is determined by the 4Td- 
independence of Y. We have left to show that the expectation of the Taylor 
error is small. We split this error into cases based on whether Qe, m differs from 
its mean value by more than a constant multiple. 

If no Qi >m varies by this much, the Taylor error is at most the sum over 

sequences £1, . . . ,£t,ttii, ■ ■ ■ ,rwr of n*=i \Qturm — ^[Qtum-iW times an appro- 
priate partial derivative. Note that by Lemma [T5] this derivative has size at 

most Od (j]J =1 }e[q7~ — fj)- Noting that there are at most 2°^ d ' > such terms 

and that we can bound the expectation above as 

\ - / \Qii,m z — E[Q^ 

h\ nQu, ni ]\ ) ■ 

Since T is even, the above is a polynomial in Y of degree 4Td. Since Y is 
4T<i-independent, the expectation of the above is the same as it would be for Y 
fully independent. By Corollary [5] this is at most 

2 Oo(d)J2 f YaT Y[Qei,rm] \ T/2 < 2 Oc(d)(0 e -2)T/2 < 2°^ d) eN^ . 

If some Qi^m differs from its mean by a factor of more than 2, the Taylor 
error is at most 1 plus the size of our original Taylor term. By Cauchy-Schwarz, 
the contribution to the error is at most the square root of the expectation of 
the square of the error term times the square root of the probability that one of 
the Qi. m varies by too much. By an argument similar to the above, the former 
is 2 0c ( d \ To bound the latter, we consider the probability that a particular 
Qi tJn varies by too much. For this to happen Qi tm would need to differ from its 

mean by 2°^ (#e~ 2 ) times its standard deviation. Using Corollary [5] and 
the 8T(i-independence of Y, we bound this by considering the 2T th moment of 
Qe.m minus its mean value, and obtain a probability of 2°°^ (eN^ 1 ) 2 . Hence 
this term produces a total error of 2° c ^ <i ^eA r_1 . 

The argument for g_ is analogous. □ 

Corollary 20. If e, c > 0, X is the random variable described in Theorem [7] 
with N > e~ 4 ~ c . N = 0(d~ 2 ), k > 512c _1 d andY is a fully independent family 
of random Gaussians then 

\E[g + (X)]-E[g + (Y)]\<2°^e. 

The same also holds for g- . 
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Proof. We let Yi be independent random standard Gaussians and let Y — 
VnEI^. We let = feU Y t + £^+1 *i) ■ Note that Z N = Y 
and Z° = X. We claim that 

\E[g + (P)} - E[g + (Z^)]\ = 2°^eN-\ 

from which our result would follow. Let Zj := ^ N :_ 1 f S<=i ^ + -^i) > 

then this is 

\E Zj , Yj [g+(N Y . (^))] - V Zi , Xj [9+(N e Xj (Zj))]\ 

< E Zj [\E Yj [g + (N Yj (Zj))} - E Xj [g + (N e X] (Zj)) 

which by Lemma ITTJ1 is at most 2° a ^eN^ 1 . □ 

We can now prove Theorem [TJ 

Proof. We prove that for X as given with N > e~ 4 ~ c and k > 512c -1 <i, and Y 
fully independent that 

\E[f(X)]-E[f(Y)]\ <2°^e. 

The rest will follow by replacing e by e' = e/2° c ^ . We note that / is sandwiched 
between 2g + — 1 and 1 — 2g_. Now X fools both of these functions to within 
2 c M d ) e . Furthermore by LemmafTTl they have expectations that differ by 2°^e. 
Therefore 

E[f(X)} < E[l - 2g_(X)} « 2 o c( ,) £ E[l - 2g.(Y)} ^ 2 o We E[f(Y)}. 
We also have a similar lower bound. This proves the Theorem. □ 



6 Finite Entropy Version 

The random variable X described in the previous sections, although it does fool 
PTFs, has infinite entropy and hence cannot be used directly to make a PRC 
We fix this by instead using a finite entropy random variable that approximates 
X. In order to make this work, we will need the following Lemma. 

Lemma 21. Let Xi be a k -independent family of Gaussians for 1 < i < N, 
so that the Xi are independent of each other and k,N satisfy the hypothesis 
in Theorem]]} for some c,e,d. Let 6 > 0. Suppose that Z^j are any random 
variables so that for each \Xij — Zij\ < 6 with probability 1 — 5. Then 
the family of random variables Zj — Si=i fools degree d polynomial 
threshold functions up to e + 0(nN5 + d\J nN log(S~ 1 )S 1 / d ). 

The basic idea is that with probability 1 — nN8, we will have that \X^j — 
Zij\ < 6 for all i,j. If that is the case, then for any polynomial p it should be 
the case that p(X) is close to p(Z). In particular, we show: 
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Lemma 22. Let p(X) be a polynomial of degree d in n variables. Let X G 
R™ be a vector with \X\oo < B {B > 1). Let X' be another vector, so that 
(X-X'loo <8<1. Then 

\p(X)-p(X')\ < 5\p\ 2 n d ^O(B) d 

Proof. We begin by writing p in terms of Hcrmitc polynomials. We can write 

P (X) =J2 ai hi{X). 

Here S is a set of size less than n d , hi(X) is a Hermite polynomial of degree d 
and ^2 ieS af — \p\ 2 . The Hermite polynomial hi has 2°^ d > coefficients each of 
size 2°^'. Hence any of its partial derivatives at a point of L°° norm at most B 
is at most 0(B) d . Hence by the intermediate value theorem, \hi(X) — hi(X')\ = 
SO(B) d . Hence \p(X) - p(X')\ < SJ2 ieS \a t \0(B) d . But £ ieS kl < \pkV\S\ 
by Cauchy-Schwarz. Hence 

\p{X)-p{X')\<5\p\ 2 n d ' 2 0{B) d . 

□ 

We also need to know that it is unlikely that changing the value of p by a 
little will change its sign. In particular we have the following anticoncentration 
result, which is an easy consequence of [2] Theorem 8: 

Lemma 23 (Carbery and Wright). Lf p is a degree-d polynomial then 

Pr(\p(X)\<e\p\ 2 ) = 0(de 1 / d ). 

Where the probability is over X , a standard n-dimensional Gaussian. 

We are now ready to prove Lemma 1211 

Proof. Note that with probability 1 — 6 that \X iy j\ = 0(log<5 _1 ). Hence with 
probability 1— 0(nNS) we have that IX^"^ = 0(log(5 _1 ) and ^i^loo < S. 

Let p be a degree d polynomial normalized so that \p\ 2 = 1. We may think of p 
as a function of nN variables rather than just N, by thinking of p(X) instead as 

p J2iLi ■ Applying Lemma[Hl we have therefore that with probability 

1 - 0(nN8) that \p(X) - p(Z)\ < ( 50(V^Vlog(^ 1 )) d . 

We therefore have that if Y is a standard family of Gaussians that 

Pr(p(Z) < 0) < 0(nNS) +Pt( P (X) < 60{VnN log(<5" 1 )) <i ) 

< e + 0(nN5) + Px(p{Y) < 60(Vn~Nlog{S~ 1 )) d ) 

< e + 0{nNS + dVn~N dog^ 1 ^ 1 ' ld ) + Pr(p{Y) < 0). 

The last step above following from Lemma [23l We similarly get a bound in the 
other direction, completing the proof. □ 
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We are now prepared to prove Corollary [5] 

Proof. Given e,c > 0, let k,N be as required in the statement of Theorem 
[TJ We will attempt to produce an effectively computable family of random 
variables Zij so that for some fc-independent families of Gaussians Xi we have 
that \Xij — Zi t j\ < 8 with probability 1 — 8 for each i,j and 8 sufficiently small. 
Our result will then follow from Lemma [2"T1 

Firstly, it is clear that in order to do this we need to understand how to 
actually effectively compute Gaussian random variables. Note that if u and v 
are independent uniform [0, 1] random variables, then yj — 2 log(u) cos(2tt«) is a 
Gaussian. Hence we can let our Xij be given by 



X itj = y-21og(w ij -)cos(27ru ij ), 

where Ui and Vi are ^-independent families of uniform [0, 1] random variables. 
We let u^pv'^j be M-bit approximations to Uij^Vij (i.e. u[ ^ is Uij rounded 
up to the nearest multiple of 2~ M , and similarly for v^-), and let Z^j = 

J— 2\og{u' i j) cos(27TUj i3 -). Note that we can equivalently compute Z^j be let- 
ting itj, v[ be fc-independent families of variables taken uniformly from {2~ M , 2 • 
2~ M , . . . , 1}. Hence, the Zij are effectively computable from a random seed of 
size O(kNM). 

We now need to show that \Xi j — Zi j\ is small with high probability. Let 
a(u,v) = v /-21og(u) cos(27ro). Note that for u, v e [0,1] that \a'\ = 0(l+u~ 1 + 
(1 — m) -1 / 2 ). Therefore, (unless u\ ^ = 1) we have that since X^j — a(ui t j,Vij) 
and Zij — a(ti^,v$ _•), and since \u% l j — u'iA, \vij — v[ ^\ < 2~ , we have that 

\X id - Zij\ = 0(2- M (l + u- 1 + (1 - u)- 1 / 2 )). 

Now letting 8 = 0(2" Af / 2 ), we have that 2" M (1 + w" 1 + (1 - u)- 1 / 2 ) < 8 
with probability more than 1 — 8. Hence for such 8, we can apply Lemma 
|2"T1 and find that Z fools degree d polynomial threshold functions to within 
e + 0(nN8 + dV^ log^ 1 )^). If 8 < e 3d (dnN)~ 3d , then this is 0(e) (since 
for x > d 3d , we have that xlog~ d (x) > x 1 / 3 ). Hence with k = Q c (d),N = 
2n c (d) 6 -4-c an( ^ ^ _ f] c ((ilog((ine _1 )), this gives us a PRG that e- fools degree 
d polynomial threshold functions and has seed length O(kNM). Changing c 
by a bit to absorb the loge -1 into the e _4_c , and absorbing the dlogd into the 
2 o c (d) ; this geed length is log(n)2 ^ d )e- 4 - c . 

□ 
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